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Spiking the random matrix hard edge 

Jose A. Ramirez* Brian Rider^ 


Abstract 

We characterize the limiting smallest eigenvalue distributions (or hard edge laws) for 
sample covariance type matrices drawn from a spiked population. In the case of a single 
spike, the results are valid in the context of the general /3 ensembles. For multiple spikes, 
the necessary construction restricts matters to real, complex or quaternion (j3 = 1,2, 
or 4) ensembles. The limit laws are described in terms of a random integral operators, 
and partial differential equations satisfied by the corresponding distribution functions 
are derived as corollaries. We also show that, under a natural limit, all spiked hard 
edge laws derived here degenerate to the critically spiked soft edge laws (or deformed 
Tracy-Widom laws). The latter were first described at /3 = 2 by Baik, Ben Arous, 
and Peche [2, and from a unified /3 random operator point of view by Bloemendal and 
Virag PH]. 


1 Introduction 

A basic problem in mathematical statistics is to describe the law of the largest eigenvalue 
of sample covariance (or Wishart) matrices of the form XEAU in which A is n x m and 
comprised independent unit Gaussians in F = M, C, or HI and f is the associated conjugate 
transpose. One is typically interested in the limit as n and m tend to oo with E some 
deterministic sequence of symmetric population matrices. When E is the identity, this so- 
called soft-edge limit is well known to be given by the /3 = 1,2 or 4 Tracy-Widom laws 
[27 , [28] (for the case of K, C, or HI entries respectively). Moving toward the more general 
problem, the spiked ensembles introduced by Johnstone in which E = E r © I m - r and r 
remains fixed as n and m grow have generated considerable interest. Here the subscripts 
specify the dimensions of the diagonal matrix E r and identity matrix J m _ r . 

Using the determinantal framework at /3 — 2, [2] proved there exists a phase transition 
(with respect to the entries of E r ). Below criticality one sees Tracy-Widom in the limit, above 
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criticality there are Gaussian effects (the limit given by the law of the largest eigenvalue of 
an r x r GUE), with a new one parameter family of spiked soft-edge laws in the crossover 
regime. Subsequent analytic work was carried at (3 = 1 and /3 = 4 in [15] and |31j . 

In another direction, Bloemendal-Virag 00 proved that the j3 = 1,2, or 4 soft-edge 
(critically) spiked laws can be characterized in a unified way through the eigenvalue problem 
for the random operator B acting on functions / £ L 2 [[0, oo),F r ] defined by 

n = -~ + rx + V2B' x , rn = WHO). (1.1) 

Here B t is the standard F-invariant Brownian motion, that is, for U £ C/ r (F) = {V £ F rxr : 
VV^ — 1} it holds that UB t W ~ B t (equality in law), and W is the appropriate scaling limit 
of the matrix E r . At r = 1 the result holds for all (3 > 0. In that case the noise term reduces 
to for x i —y b x a standard Brownian motion, and B is recognized as the Stochastic Airy 
Operator from [T9] , but with a simple Robin /'(0) = wf( 0) rather than Dirichlet boundary 
condition at the origin. Hence, for r = 1 the single parameter c interpolates between the 
general beta Tracy-Wiclom law at w = +oo (which forces /(0) = 0) and the Gaussian law 
in the supercritical regime, w = —oo (which requires a little perturbation argument to make 
precise, see 0)- 

The results of 00 follow upon those of [19] in that they hinge on tridiagonal represen¬ 
tations for the spiked Wishart ensembles. For r = 1 these are the same random tridiagonal 
matrices introduced by Dumitriu-Edelman cm, which subsequently formed the basis for the 
conjectured Stochastic Airy Operator formulation of the Tracy-Widom(/3) laws in [11, [26] 
(and again proved in [19]). In particular consider the XE 1 / 2 for X as above (comprised of 
iid real/complex/quaternion Gaussians) but specify E = a © I m -i for a scalar a. Then the 
classical Householder procedure can be carried out, taking XE 1 / 2 into 


B 


a 


l 

7 ? 


'/&Xml3 X(n-l)/3 

X(m— 1)/3 X(n—2)/3 


X(m—n-\-2)f3 X/3 

X(m—n+l)/3 


( 1 . 2 ) 


up to a unitary conjugation. That is, B a Bj has the same eigenvalues as XEX^ in the 
real/complex/quaternion (/3 = 1/2/4) cases, and afterwards this ensemble may be declared 
the general one-spiked model. For r > 1, one of the key contributions of 0] is a block tridi- 
agonalization procedure (which we revisit in Section 3) that commutes with r-fold spiking. 
Unfortunately, this model does not immediately generalize to all (3 > 0. 

Our goal here is to establish an analogous set of results for the critically spiked “hard 
edge”. The hard edge refers to the limit laws for the minimal eigenvalues of Wishart type 
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ensembles in the regime m = n + a (which forces a non-trivial interaction of the random spec¬ 
trum with the origin). Classically - that is, for /3 — 1, 2,4 - these laws were also identified by 
Tracy and Widom in terms of Painleve functions [28]. They come indexed by the parameter 
a and degenerate to the more well known Tracy-Widom (/3 = 1, 2,4) laws of [271 [29] in the 
limit a —> oo. This hard-to-soft transition in the classical setting appears to have first been 
worked out in [3j. For general ft > 0 we introduced a random operator for the hard edge in 
[IS] and demonstrated that, as expected, its ordered eigenvalues went over into those for the 
Stochastic Airy Operator for all f3. 

To critically spike the hard edge the entries of E r must be tuned to zero as n —> oo. We 
show that if nT, r has an appropriate limit then there is a new family of random operator 
limits indexed by these spiking parameters. These spiked hard edge operators produce 
diffusion/PDE descriptions for the new limit laws (in the manner familiar from [3HTHT8. fl9j). 
and, again as expected, recapture all known spiked Tracy-Widom laws in the a —> oo limit. 

We mention that for /3 = 2 and at the level of correlation functions the hard edge spiking 
was worked out in [8]. 

The next few articles of the introduction detail: (i) the limiting spiked hard edge op¬ 
erators, (ii) diffusion and partial differential equation descriptions of the corresponding 
spiked hard edge laws, (iii) the transition between the spiked hard edge laws and the 
spiked/deformed Tracy-Widom laws (for all fi at r = 1, and for j3 = 1,2,4 otherwise), 
and last (iv) a description of the supercritical regime for the hard edge via a recent matrix 
generalization of the Dufresne identity. 

Operator limits 

While structurally similar we state the limiting operator characterizations separately for 
r = 1 and r > 1. In addition to the restriction of the multispike limit operator to fi = 1,2, 
or 4, the r = 1 operator has a secondary diffusion description not available for r > 1. 

We denote the r-spiked Wishart ensemble by W n> s r , or simply W ny(J when r = 1 (in which 
case it has the representation B a Bfi with B a as in (ll.2j) and m = n + a). 

Theorem 1. Given a standard Brownian motion x t —> b(x), define: for any a > — 1 and 

P > 

m(Gfe) = e ( ' a+1 - )3; GP^dx, s(dx) = e ax+ dx, (1.3) 

along with the (random) integral operator 

noo rx/\ y -i noo 

(0/)(z)=/ / s(dz)f(y)m(dy) + - / f(y)m(dy). (1.4) 

Jo Jo c Jo 

For c G (0,oo], C5 = ®/ 3 , a ,c is almost surely compact on L 2 (M + ,m) with inverse eigenvalues 
0 < A 0 < Ai < • • • corresponding to the problem f = A(5f. Further, as n —$■ oo with 
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ncr —» c, the ordered eigenvalues of the scaled one-spiked matrix ensemble nW nt(y converge to 
the collection {A*,} in the sense of finite dimensional distributions. 


What is attractive is that 0 is an explicit, rank one perturbation of the hard-edge limit 
operator found in [18] for the unspiked case — which is recovered by setting c = +oo in (II.4p . 


Just as in that case, pretending that everything in sight can be differentiated at will, 0 is 
seen to represent the Green’s operator for the generatoi0 


£ = e* 


d 1 2 . 2 .. d 

—— — (a H- -=h (x)) — 

dx 2 yT? dx 


(1.5) 


or £ = — 0 _1 . This can be made precise by recognizing m(dx) and f x s(dz ) as the (random) 
speed measure and scale function for the associated diffusion 1 H>■ X t . In particular, £ = ^ jj. 

Fully specifying £ requires assigning a boundary condition at the origin. It is easy to 
see that any solution / E L 2 [m] of f{x) = A0/(x) is actually C 3 / 2 ~. Taking one derivative 
throughout this eigenvalue/eigenfunction identity produces 

poo 

/'(0) = A / f(y)m(dy) 

Jo 

upon evaluating both sides at zero. And as we also have that /(0) = £ / 0 °° f(y)m(dy), we 
conclude that 

/'(0) = c/( 0), (1.6) 


for any L 2 eigenfunction. That is to say, just as in the case soft-edge spiking there is a Robin 
boundary condition at the origin of the (differential) operator limit. 

From a probabilistic point of view this boundary condition entails adjoining a killing 
measure cSo(x) to the speed-scale description. Pathwise, X t is constructed from the more 
familiar process t t—)■ X° with the same speed and scale, but with simple reflection at the 
origin. With L t the local time of X° at the origin, X t is defined to agree with those of X° up 
to the killing time r defined by the conditional distributions P(r > t \ X°) = e~ cLt , at which 
point the path is absorbed. From this standpoint c = +oo denotes instantaneous killing, 
which is another way to recognize that the standard hard edge diffusion is absorbed on its 
first passage to the origin. In all cases, the behavior at infinity can be read off from the 
speed and scale and depends on whether a > 0 or a < 0. For a > 0, +oo is an entrance, but 
not exit, point for the process. For a G (—1, 0), +oo is both entrance and exit. In the latter 
case, it is appropriate to view the process as reflected at infinity. Detailed background on 
this sort of construction and boundary classification can be found in Chapter 4 of 

Moving to r > 1, the limit is still a random integral operator, but now with an r x r 
matrix kernel, acting on the appropriate L 2 space of vector valued functions. Again, as we 

1 In [T8] the generator is denoted by —0, with 0 -1 reserved for the Green’s operator. This should’t cause 

confusion here. 
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use the block tridiagonalization procedure of Bloemendal-Virag, the operator itself has no 
natural extension off of /3 = 1, 2, or 4. 

Denote by F the field of real, complex, or quaternion numbers. By the corresponding 
(3 = 1,2, or 4 Brownian motion we mean the r x r matrix of independent unit F-valued 
Brownian motions at all off-diagonals entries, and independent real Brownian motions with 
diffusion coefficient 4 along the diagonal. 


Theorem 2. Take r > 1 and f3 = 1, 2, or 4. Define the r x r matrix processes: 


dA x A x dB x T ( ^ 2,(3^A x dx , Aq /, 

with the appropriate (3 = 1, 2,4 matrix Brownian motion x i—>- B x . Set 

m , = rM,4. 5, = (A4r‘ 


(1.7) 


( 1 , 8 ) 


Then, if (nS r ) 1 converges in norm to a non-negative definite C 1 , the limiting r-spiked 
hard edge operator 0 r = & r ,/3,a,c reads 

/ oo / pxA y \ poo 

ij S z dz \ M. y f (y)dy + C~ x J M y f{y)dy, (1.9) 

and is positive compact on L 2 [M.\ = {/ G F r : J 0 °° /f (x)M. x f(x)dx < oo} to itself. In 
particular, the ordered eigenvalues ofnW n ^ converge (again in the sense of finite dimensional 
distribution) to the L 2 -spectrum A rj0 < A r j < • • • for the problem f = A & r f. 

Putting r = 1 in the equations (1 1. 7[) and (11.8p which define the matrix kernel, both A4 X 
and S x reduce to the speed and scale (m (dx)/dx,s(dx)/dx) functions from (11.3(1 . as they 
must. Further, the eigenfunctions of 0. r will again be C 3 ^ 2 ~ entry-wise, and the inherited 
boundary condition at zero is now /(0) = Cf'{ 0) in complete analogy to (11.61) . On the other 
hand, for r > 1 there is no apparent interpretation of 0^T X as a diffusion generator. 

Note that the statement allows for some number eigenvalues of C~ l to vanish. This 
translates into a Dirichlet condition at the origin for the corresponding coordinates of /. 

Theorems [D and [2] are proved in Sections [2] and [3] respectively. The proof of Theorem [D is 
a re-working of the main result of £18|. By virtue of having to deal with matrix rather than 
scalar kernels the proof of Theorem [2] is more elaborate, but the overall strategy remains 
the same. In each case the matrix ( nW n )~ l , with W n = W na or W n ^, is embedded as 
an operator into L 2 [0,1] (of functions taking values in M or F r ). After this embedding, 
the proofs demonstrate a coupling for which ( nW n )“ 1 converges to an integral operator (G 
or G r , defined below in (12.81) or (13.91) . respectively) in strong operator norm. Thus, we 
have convergence of any finite number of eigenvalues and their corresponding eigenvalues (as 
elements of L 2 [0,1]) to the eigenvalues/eigenvectors of these (r dependent) G operators. The 
half-line operators 0 and 0 r are related to the G and G r by an explicit change of variables. 
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Riccati correspondence and PDEs 


The classical Riccati substitution was used in 1.19] to provide a description of the Tracy- 
Widom(/3) laws in terms of the explosion times of a certain diffusion process. Applying the 
same ideas to the spiked soft edge operators, concluded that the space-time generator of 
that diffusion yields a completely analytic characterization of the family of spiked/deformed 
soft edge laws. In brief, the introduction of the variable boundary condition produces the 
previously missing space-like variable in the corresponding PDE(s). These PDEs have sub¬ 
sequently been used to confirm a conjectured Painleve formula for the spiked (3 = 4 Tracy- 
Widorn law form [31] . and more recently has led to Painleve formulas at (3 = 6 [23] (the first 
appearance of such outside of (3 = 1, 2,4). 

Working with the differential forms of © or 0 r yields a similar picture at the spiked hard 
edge. The rank one case follows directly from the Riccati diffusion for the “standard” beta 
hard edge introduced in [IS] . 


Rank one hard edge process Define P Al>c to be the measure on paths q : [/x, oo) t—)• 
[—oo, Too] induced by the (strong) solution of 


dq x 



P 


)Qx 


qI 



dx, 


( 1 . 10 ) 


with initial condition q iL = c and where q is restarted at Too after any explosion to —oo. In 
particular, the exit and entrance points —oo and Too are joined so that the q paths may be 
viewed as continuous for all time. 


The hard-edge(/3) laws introduced in [18] correspond to starting q from the entrance 
barrier c = Too. Note also that the Riccati process there puts the spectral parameter inside 
the drift: the e~ x term in (II. 10)1 is replaced with Xe~ x . Here we are simply making the 
change of variable A = e -/i in order to move this variable into a starting “time.” In any 
case, it is clear that once q hits zero at any finite time x, it immediately becomes and stays 
negative, at which point there is a positive probability of exploding to — oo. The spectral 
counting function is then given by counting the passes to zero of the pieced together process, 
restarted at Too after any explosion. 


Theorem 3. Let 0 < A 0 < Ai < • • • be the limiting spiked hard edge eigenvalues, the inverse 
eigenvalues of 0 = <8p t a tC , for any (3 > 0, a > — 1 and c E (0, Too]. Notate F k (p, c ) = P(A fc > 
e ~ M ), with F = F 0 for simplicity. Then, 


F k (p, c ) = P^ )C (q x vanishes at most k times). 

It follows that F is the unique bounded positive solution of 

OF 2 2 d 2 F .. 2. 2 _ u .dF n 

yb c 8? +(( “b )c ‘ c “ e ) ^ =0 - 


( 1 . 11 ) 


( 1 . 12 ) 


6 





subject to 


lim F(fi, c) = 1 for all c > 0, lim F(/x, c) = 0 for all p < oo. 

fj ,too c4,0 


(1.13) 


The higher order distributions F k solve the same PDE, but with the nested boundary condi¬ 
tions lim 4 _oo F k (/i, c) = F k _ L (/x, +oo) in place of lim 40 F(p , c) = 0. 


As a spiked parameter c is to be taken positive, but q can obviously be started from any 
cel. In addition, when counting k > 1 vanishings it is natural to consider starting points 
c < 0, since subsequent vanishes can only occur after an explosion to — oo. In this way the 
definition of F k is extended to negative c by the right hand side of (11.111) in a consistent way. 
For example, F(/x, c) := 0 for c < 0. 

The PDE characterization of F (not spelled out in [18]) is more or less immediate from 
the passage time description (ll.ll|) . Rumanov [22] has already used (I1.12|) to find formulas 
for F at P = 2,4 in terms of Painleve III. That same paper provides new proofs of the 
(3 = 2,4 spiked soft edge laws using the analog of the PDE (11.121) derived in [3j. It is further 
worth mentioning that more recently Rumanov [23] has found a Painleve II formula at the 
(3 = 6 soft edge (the first known outside (3 = 1,2,4) once more using the spiked/PDE picture 
of [3]. 

For the multi-spiked case, we introduce the following. 


Rank r > 1 hard edge process Define P M ,( C i, C 2 ,...tv) f° be the measure on (non-intersecting) 
paths q : [/x, oo) [—oo, oo] r governed by 


dQi,x ^yjjQi,xdbi T | (a T ^) q, 


Qu 


, „ \ ^ %X + Qj,x 

' Hi,x / J 

T,x ~ 


dx, 


(1.14) 


with initial condition q t/l = c k , i = l,...,r (and convention C\ > C 2 > ■■■). The explo¬ 
sion/restart now pertains to the lowest particle, and q is made continuous on the Weyl 
chamber for all time by following any such restart by a cyclic relabeling of the indices. 


Theorem 4. Now let (3 = 1,2, or 4 and denote by 0 < A r0 < A rl < ••• the inverse 
eigenvalues of & r = & r ,p,a,c a > —1 and C itself possessing eigenvalues ci,...,c r G 

(0, oo] r . Then F k (n, c k , c 2 ,... c r ) = P(A rifc > e -M ) has the representation: 

F k (p, ci,..., c r ) = Pfj/r,{ci,...,c T ) most k zeros among the coordinates of q x ). 

And so, with F = F 0 and ijj(c) = (a + |)c — c 2 — e -r/ b 



subject to the appropriate boundary conditions analogous to (II. 13[) . The solution is unique up 
to permutations of the variables ci, c 2 ,..., c r . The higher order distributions solve the same 
PDE, linked together inductively as in lim Cr .^_ 00 F k (p, ci,..., c r ) = F k _ i(/x, + 00 , ci,..., c r _i). 
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As indicated above, the value a = 0 is critical for the r = 1 operator in terms of 
the interpretation of the boundary condition at infinity. For a G (—1,0) the condition is 
Neumann, and for a > 0 one can view the condition as Dirichlet (the diffusion generated by 
0 _1 has a natural boundary condition there). In the Riccati picture, the difference between 
Neumann and Dirichlet is whether eigenvalues are counted by passages to 0 or to — oo. One 
might expect the same dichotomy at a = 0 in the case r > 1. What we can show is the 
following. 

Corollary 5. In either Theorem 0 or Theorem [|J if a>r — 1 we can replace the character¬ 
ization of the k-th eigenvalue distribution function in terms of the probability of (at most) k 
zeros with that of (at most) k passages to — oo. 

Theorems [3] and |4] along with the corollary are proved in Section |4j 

The hard-to-soft transition 

In EH Hj the outcome of the Riccati map is that the one-spiked soft edge distributions are 
described in terms of the probability that the process 

dp x = -jpdb x + (x - p 2 x )dx, (1.16) 

begun at w, has a given number of explosions to —oo (after subsequent restarts at +oo). 
In particular, with A 0 (/3,w) < A 2 (/3,w) < • • • the eigenvalues of the “deformed” Stochastic 
Airy Operator, in j3] it is proved that 

P(A k{(3,w) < A) = Pa ,w{p x explodes at most k times). 

Here Pa, u , indicates that p is started from the time/space point (A, w), as should be compared 
with Theorem El Again recall that — A 0 (/5,+oo) has the Tracy-Widom(/3) distribution pH) . 
Likewise, for r > 1, the process (11.1611 is replaced by the joint diffusion p = (pi,... ,p r ): 

dpi, x = -j^db i x + l rx- p- + V --- J dx. (1.17) 

And with now {A r ,k(/3, w)}, w = (w±,... ,w r ) the eigenvalues of the operator (11.ip . among 
the results of [4] is the fact that: for (3 = 1,2,4, 

P(A r ,fc(/9, w) < A) = Pa/,-, (iui,..., Wr .)(p explodes at most k times). 

Here again, the process at started from pi = Wi, i = 1,... r (with the convention that 
w r < ■ ■ ■ < w i) at the time-like parameter x = A/r. Compare Theorem |4j 







Back in [18] we proved that, for all (3, the rescaled (non-spiked) minimal hard-edge 
eigenvalue converges in distribution to the general Tracy-Widom law as a —> oo. This was 
accomplished by showing that the law of passage time to — oo for a suitably rescaled hard- 
edge process (11.101) . begun at +oo, converges to that for the soft-edge process (11.16j) . also 
begun at +oo. Previously, this “hard-to-soft transition” had been verified for (3 = 1,2,4 
using the explicit Bessel/Airy correlation functions [5]. In all cases, the intuition is that a 
is a dimension-difference parameter in the classical Wishart ensemble. Exchanging limits 
freely, a —> oo would correspond to an asymptotically rectangular Wishart, for which the 
smallest eigenvalue pulls away from the origin and has “regular” Tracy-Widom fluctuations. 

Given the above, it should be no surprise that we can re-capture all the various spiked 
soft edge laws by rescaling our spiked hard-edge laws. Here we prove the convergence in the 
sense of finite dimensional distributions of the hard and soft edge point processes (rather 
than just for the first point as was done in |[I8]). 


Theorem 6. Again denote by A 0 , Ai,... the (potentially one-spiked) hard edge eigenvalues. 
For a —» oo and c = c(a) —>■ oo with lirn^oo a~ 2 / 3 (c(a) — a) = w E (—oo,oo], there is the 
convergence in law, 


a 2 - Aj(/3,2a,c(a)) 


7,4/3 


{Aj(/3, ic) 


i=0,...,k 


for any finite k. Note the shift a (->■ 2a in the “dimension discrepancy” parameter in the Aj. 

The same appraisal holds for the r-spiked laws. Any finite collection of points from 
the family {a 2 / 3 — a _4 / 3 A ri j(/3, 2a, c(a))} converges in law to the corresponding {X rk (/3, w)} ; 
assuming (c(a))j = cfia) is such that lim^oo a _2 / 3 (cj(a) — a) = Wi E (—oo,oo], w = 
(wi, ...,w r ). 


Remark 7. Taking the point of view that {A rj j} and {A rj j} are defined via the diffusions (11.141) 
and (11.171) . that is, without reference to the limit operators, the hard-to-soft convergence in 
the r-spiked case takes place for all (3 > 1. 

The proof of Theorem [6] makes up Section [5] 


Supercritical scaling 

It is now well understood that, at the soft edge, if the spikes are too large (or “supercritical”) 
the fluctuations of A max are effectively Gaussian. More precisely, for r = 1 there is a normal 
limit law for the largest eigenvalue, while for r > 1 the rescaled largest eigenvalue converges 
to that for an r x r copy of the corresponding invariant Gaussian ensemble. For (3 — 2 
this phenomena is included in the results of [2]. It can also be derived by a perturbation 
argument in the deformed soft-edge operators. 
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While there is no critical point for the hard edge, it is clear that the analog of supercritical 
spiking is, at least for r = 1, to consider the c —> 0 limit of the operator C5/3, a ,c after the fact. 
In the multi-spiked case we take the simplest version of this, setting C = cl with c —)■ 0. For 
r = 1 we find: 


Corollary 8. As c —> 0, 0 (/3,a,c) ^Xp( a +i) distribution. 


Of course the a —* oo scaling limit of a Xp( a +i) Gaussian, recovering the soft-edge 
supercritical limit law. The proof is standard perturbation theory: as c —> 0, plainly c 6 , 5 AC 
tends (with probability one) to the rank one operator defined by integration against m(<ir). 
That is, the ground state is the constant function with corresponding eigenvalue J 0 °° m(dx), 
and the only question then is to understand that law. But an identity due to Dufresne [9] 
shows that 


/ e^-^dx ~ —, 

J o 

for a gamma random variable of the indicated parameter, which is exactly what is needed. 
For r > 1 an identical perturbation argument gives: as c —> 0, 


c l A rfi (/3,a,cI) ^ X r 


M. r dx 


-i 


(1.18) 


with A4 X is defined in (11.81) . Recognizing j}Xp( a +i) as a 1 x 1 Wishart law prompts a conjec¬ 
tured matrix Dufresne identity: spec(J 0 °° Ai x dx) should have joint law corresponding to the 
r x r inverse /3 = 1, 2, or 4 Wishart distribution. This has subsequently been proved [21], 
and we can state the following. 


Corollary 9. (See Corollary 10 of l2T( ) As c —> 0, c _1 A rj o(/3, a, cl r ) converges in distribution 
to the law of the minimal eigenvalue of the (r, r + a) real, complex, or quaternion Wishart 
ensemble. 


For completeness, in Section [ 6 ] we include a proof of Corollary [9] for (d = 2 and integer a 
using the Fredholm determinant approach of [2]. 


2 Single-spike operator limit 


After a conjugation by the diagonal matrix of alternating signs, the matrix model has the 
form nW n , a = nB a Bf in which the upper-bidigaonal B a has entries 


Xf3(n+a) X/3(n— 1+a) 

VP ' 


and — 


X/3(n— 1) Xf3{n— 2) 


VP 


VP 
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along the main, and first off-diagonal, respectively^] Keep in mind that, despite possibly 
repeated indices, all variables are independent. 

Next, any matrix A = dij G M. nxn can be embedded into L 2 [ 0,1] without changing the 
spectrum by setting: for Xj = i/n for i — 0,1,..., n and / G L 2 [ 0,1], 

TL 

W)(*) := £ a i,j n / /(x)dx, when x ,_\ < x < x*. 

7=1 J Xj -1 


Then, by the inversion formula for bidiagonal matrices, as an integral operator, 
( y/nB a )~ 1 (x,y ) = fl x>a:i (x) + l x < Xl (x)-^=^ k n (x,y), 
with the discrete (upper triangular) kernel 


( 2 , 1 ) 


mv)=^- n 

Xp{n+a-i) k=i+1 X/3{n+a-k) 


lr 


ij 5 


( 2 , 2 ) 


in which 

Ty = {0 < x < y < 1 : x G (x^x*], y G (%~i, Vj]}- (2.3) 

Note that on diagonal (if i = j) the product in (12.2ft is understood to equal one. 

Putting (12.lj) together with its transpose we can write out the action of (nW nj(T ) -1 on an 
/GL 2 [0,1]: 


1 /•! 


(wW^) /(x) = 



' x\ J 0 
-T 


k n (y,x)k n (y,z)f{z)dzdy H-fc„(0,x) / k n (0,y)f(y)dy (2.4) 

na ,/n 


:= K 2 K n f(x) + - ( - - 1 ) fc n (0,x)(fc n (0, •),/>• 
n la 


Here A" n is the integral operator with kernel k n (defined in (j2.2[) . fj2.3[l ) and (•, •) is just the 
L 2 inner product. 

The point is that (12.4|) represents (nlh n ^) _1 as a rank one perturbation of K^K n . As the 
latter is exactly the non-spiked model (just set a — 1), the relevant convergence/compactness 
properties of these operators have already been studied in [T8]. Everything we need to take 
norm limits throughout the right hand side of (12.41) is summarized in the following. 

Proposition 10. (Lemma 6 of fTBJ) Let K^ a be the integral operator on L 2 [ 0, 1] with kernel 

" y dh 


1 + a 


kp, a (x,y):={ 1-x) 2 exp 


VW ~ z ) 


(l-y) a/2 l 


x<y 


(2.5) 


2 In !_f8] we also flipped the indexing by conjugating as in B >->• SBS 1 with Sij = (—iySi+j- n -i. With 
the benefit of hindsight it is more convenient not to make this extra move. 
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Here z ha b z is a standard Brownian motion. Then, for any sequence of the operators K n , 
n —> oo, there is a subsequence n' —> oo and suitable probability space on which K n / —> K^ a 
almost surely in Hilbert-Schmidt norm. 


Proof of Theorem 0 Since it serves as a model for the multi-spike case, we first sketch the 
main steps behind the above proposition. Here we drop the (/3, a) subscripts on k and K. 

It is straightforward to show that there is the convergence 

y/fri 1 ' . 


X/3(n+a—i) 


X/3(n—k) 

\/l — x k=i+l X-P{n+a-k) 


n 


y 


X 


rv ab z 
e J * y/P(l-z) 


( 2 . 6 ) 


as processes. So k n (x,y ) —» k(x,y ) pointwise on [0, l] 2 . With a bit more effort it is shown 
in [T8] that there is the estimate 

K(x,y) < ' e c ^ + ^l x<y (2.7) 

\/l — x — x/ 

( 1 \ 1/2+ 

:= K Cn (x, y) with 0(z) = f 1 + log - 1 

for C n a tight random constant |§ This comes down to a quantitative version of the law of 
the iterated logarithm for the random sum log X/3(n ~ fc) , And then by the law of 

<=, X/ 3 (n+a-k ) J 

the iterated logarithm for Brownian motion, we also have that the limit kernel k satisfies 
the same bound, but with C n replaced by a fixed (be., independent of n ) random constant 
C < oo 

Now we can choose a subsequence n' — > oo over which C n > converges, and then realize 
this convergence as almost sure convergence on some probability space. With this set¬ 
up we have both k n >(x,y ) < Kc(x,y) and k(x,y ) < K c (x,y) after possibly adjusting C. 
But Jj 0 Kc(x, y)dxdy < oo, and this supplies the domination needed to conclude that 
\\k n ~ ^||l 2 [o,i] 2 ~t 0. 

The new object appearing in the spiked operator (12.4ft is the projection onto the function 
k n (0,x). But the estimates just employed show this also converges almost surely (in the 
same sub-sequential coupling) to the projection onto k( 0,x). Hence, if an -A c, we have that 
(n'Wn^tr) -1 converges in norm to G defined by 


Gf(x) =(K T Kf)(x) + - k(0,x)(k(0, ■),/) 


= / (1 - y- /2 e 


( 2 . 8 ) 


1 

-( 

c 


a/2 So 

db t / 

' nr- dh t 

rx/\y e Jo ^ p(1 - t ) 

s/PCt-t) 


Jo (1 - -) a+1 

y/2 e fc 

x dbt 

r°° , rv 

1 vw-d 

/ (i — y) a 2 r Jo 


dz I (1 ~y) a/2 e fo VK=*f(y)dy 


'0 


3 flSl reports an exponent of 3/4 in the bounding function rf>, but the proof there shows that this can be 
replaced by anything greater than 1/2. 
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yet again on the chosen probability space. Norm convergence implies that we have conver¬ 
gence of finite parts of the spectrum (this is standard, but Theorem 1 of [T8] includes a proof 
in the present context). Finally, since any such extracted subsequence produces the same 
norm limit, we have that any finite collection of eigenvalues and eigenvectors of nW Uj(T (the 
latter as elements of L 2 [ 0,1]) converge jointly in law over the full sequence n —>• oo. 

Just to indicate the map from G to 0, one checks by Brownian time change that 


4(0,1 - ■ ! m{dx) 


dx 

where the equalities are in law and b x is a new Brownian motion. After the same change of 
variables, the kernel of K T K satisfies 


(k T k){ 1 - e~ x , 1 - e- y )e~ x/2 e- y/2 





m (dx) 
dx 



Together this means that G acting on functions / G L 2 [0,1] possesses the same eigenvalues 
as 0 acting functions on f(x) = /(I — e~ x )e lX+ ^P bx G L 2 [[ 0, oo),m]. □ 


3 Multi-spiked operator limit 

The starting point is the block bidiagonalization procedure of Bloemendal-Virag, introduced 
in [J] to analyze multiple spikes at the soft edge. Let S = S r © I n _ r with £ r an r x r 
positive definite matrix, and consider XE 1 / 2 with X an n x (n + a) Gaussian matrix with 
independent unit real, complex, or quaternion entries. The key ingredient from [4j is the 
existence of U, V with UU^ = VV^ = / so that UXV is “upper block bidiagonal” with r x r 
blocks. Denoting by Dk, k = 1,... |~n/r~| and Ok, k = 1,..., \n/r] — 1 the diagonal and 
immediately above diagonal blocks, it holds that 

{ y^X/3(n+a-r(fc-l)-i+l) if * = j, 

9ij if j>i, (3- 1 ) 

0 otherwise, 

and 

rk— i+1) if * 3i 

9ij if j < i, (3-2) 

0 otherwise. 

The x' s are again y random variables of the indicated parameter, but now the p’s are F- 
valued unit Gaussians (which prevents a natural extension to general beta). As usual, despite 
repeated notation, all random variables in sight are independent. Note also that the final 
blocks may need to be truncated in the obvious way (whenever r fails to divide n). 
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What is important is that the outcome of applying the above procedure to XE 1 / 2 rather 

1/2 

than to just X is simply to replace D\ with D\H r . This defines a block bidiagonal random 
matrix B = 1/2 such that W n ^ = BB t has the same eigenvalues as XEXf 


3.1 Identifying the limit 

The program follows the r — 1. The scaled matrix (y/nB-% 1 / 2 ) _1 is imbedded into the 
appropriate L 2 space. The block structure dictates that this should be the vector valued 
functions /: [0,1] 1 —» F r with square-norm ||/|| 2 = Jj {) ^ f(x)^f(x)dx < 00 . 

One can again use the (block) bi-diagonal structure to compute the inverse explicitly. 
In conjunction with a conjugation by a matrix of alternating signed r x r identity matrices 
(which has the effect of replacing each Ok with —Ok), the action of (^/nB^ i/ 2 ) _1 can be 
identified with that of the discrete matrix kernel: 

k n , r {x,y ) = ~ D i l ( n OkD^ 1 J 1 i<j, for x G y G [yj-i,yj). (3.3) 

\k=i+l / 

Here Xi — —, yj = ^ and j < [n/r]E The point being that the discretization scale is now 
A = j-, which accounts for the 1 prefactor on (13.31) . This object left multiplies functions 
f(y) E F r (prior to integration in the y variable). Compare the r = 1 kernel (I2.2[) . 

Next, setting 

[nx / r] 

d n (x) = Ofc-Dfc+n (3-4) 

k =1 

we can write the kernel (13.31) in a streamlined fashion, 


kn,rijE,y) ^ l^[2i 3 ,]d. rl (x) d n (|/)lo< 3 ;<y<l, (3-5) 

granted some ambiguity in the indexing. This matrix process x 1 —> A n (x) satisfies the discrete 
equation 

A n (x + A) - A n (x) = A n (x) (o [nx/r]+ 1 D^ n l /r]+2 - /) , (3.6) 

with the convention that H n (0) = /, the r x r identity. Estimating moments of the entries 
of the (OD~ l — I) multiplier, it is not difficult to convince oneself that x 1 —y A n (x),x G [0,1] 
converges as a process to x t —> A x defined as the (unique) solution of the Ito equation: 


cL4 r = 




-A x dB x + 


x 


+ 


2r(l — x) 2/3r(l — x) 


A x dx , H 0 = I. 


(3.7) 


Here B x is an r x r matrix Brownian motion introduced above. Each diagonal is a real 
Brownian motion with diffusion coefficient each off-diagonal is a unit F-valued Brownian 

throughout [•] denotes the (appropriate) integer part. 
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motions, and all coordinate processes are independent. This structure is simply a conse¬ 
quence of the blocks (£)*,, Ok) having (real) y-variables along the diagonal and F-valued 
Gaussians otherwise. 

Moving along, it is easier still to prove the process level convergence => \/l — xl. 

The (preliminary) conclusion would be that k n:T converges in law to 

k r (x, y) = / A x A l 0 < x<y <i, (3-8) 

rv 1 — x 

in the uniform-on-compacts topology of [0,1) x [0,1). Hence, granting the existence of a 
positive definite C = lim, woo nT, r , the candidate operator limit of (nWo)~ l is: 


G r f(x) 




1 


-A-M-s) 


Ayf{y)d-dy + -A'/' 1 / A y f(y)dy, 


(3.9) 


again acting on / E L 2 ({ 0,1] t->- F r ). This is the analog of (12. 8 j) . 

Finally, to go from G r to the representation of (5 r in the statement of Theorem [21 is 
through a similar change of variables employed at r = 1: now 1 — x K > e~ rx . In particular, 
defining A x = A x _ e -rx, x E [0, oo) we have that 


dAx 


A x dB x T ( ^ T 2 / g)'^ x ^ x ’ 


(3.10) 


with a new matrix F-Brownia.n motion B x , compare (11.71) . Then, similar to the one¬ 
dimensional case, after this substitution one may symmetrize with respect to L 2 [M]. Recall 
that A4 = e~ rx A x A\. and f E L 2 [M] if / 0 °°(ft Aif)(x)dx < oo. In particular, view the 
transformed (j3.9j) as acting of functions f of the form A~^f( 1 — e~ rx )e rx ^ which reside in 
L 2 [M] by construction. This sends eigenvectors / of G r into eigenvectors f of & r with the 
corresponding eigenvalues unchanged. 


3.2 Outline of the proof 

The convergence of the spectrum requires some form of norm convergence of the operators 
K n>r to K r (these denoting the integral operators associated with the kernels k n ^ r and k r of 
(13.5|) and (13.81) ). As in the proof of Theorem |Tj (recall the argument surrounding Proposition 
[TUP things follow readily given that we can show: 

Theorem 11. The exists a probability space on which all k n>r and k r are defined and, given 
any sequence n f oo, a subsequence n' f oo such that 

|| k n ' :r (x,y) - k r (x,y)\\ 2 dxdy -E 0, (3.11) 

with probability one. Here || • || is the (matrix) operator norm. 
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Step 1 is to prove (Lemma fl2lof Section l3T3j) that the limit kernel k r (x, y) = —h=A X 1 A y l x < y 
is almost surely Hilbert-Schmidt (showing that 0 r has the properties claimed in Theorem 

0 . 

Step 2 constructs a (random) dominating kernel, on a restricted domain. We show that, 
for large enough c there is a kernel k rjC (x,y) such that: with a tight random sequence C n , 

lo£f Tl 

k n ,r(x, y) < C n k r c (x, y) for 0 < x < y < 1 - c-, (3-12) 

n 

where k T}C (x, y ) satisfies 

\\k r , c {x,y)\\ 2 dxdy < oo. (3.13) 

See Corollary [16] of Section 13.51 This step also relies heavily on Lemma [14] of Section 13.41 
which represents the product A n (x) as a fairly explicit (though approximate) Euler scheme 
for the limiting x H > A x . Along the way we conclude that k ntr (x,y) —$■ k r (x,y) pointwise. 

Step 3 uses Step 2 as input to show the rest of the range of integration can be neglected: 

\\k n:r (x,y)\\ 2 dydx -»■ 0 (3.14) 

(X xCjy < 1. y>l—c lo ^ n 

in probability (Lemma [TH] of Section IH1)D . 

From here the proof follows the course of the r = 1 case. The only real wrinkle is the 
requirement of the cutoff (off the last O(logn) discrete time steps) in the domination (13.121) 
(compare this to the bound (j2.7[) used at r = 1). Tracking the growth of the matrix kernels 
is just more involved (we also make due with a suboptimal bounding kernel k r c in terms of 
its behavior near x,y — 1, again compare (12.71) 1. 

Anyway, one is now free to choose a subsequence and a probability space on which (13.141) 
takes place almost surely and the bound (13. 12)1 holds almost surely with the tight sequence 
C n replaced by a deterministic c (which bounds the chosen sub-sequencial limit of C n ). 
Together with Step 1 this provides the necessary domination for the conclusion of Theorem 
dU (We have already mentioned we will have that k n r —> k r pointwise, and this can also be 
realized almost surely by choice of a further subsequence.) 

3.3 The limit operator 

As for r — 1, the limit kernel k r is Hilbert-Schmidt (and so 0 r is compact). The advantage 
of the continuum setting is that one can readily derive an sde for the singular values of 
the family (x,y) i —> A^Ay. The analysis is similar to that for the Lyapunov exponents for 
Brownian motion on the general linear group, see for example ra- 
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Lemma 12. With probability one, 


1 /•! 



r*oo poo 


0 J x 


1 — x 


|| A x 1 A y \\ 2 dydx = r 2 


e 1 \A S 1 A t \\ 2 dtds < oo, 


(3.15) 


recall |X7|) and Kd.ltt) . Again || ■ || is the matrix operator norm. 


Proof. As all norms are equivalent we can reduce this to an estimate on the largest singular 
value of the two-parameter family Af l At . Thus, introducing the processes t (->• U(t',s) — 
A} t Af J[ Af 1 At subject to U(s\s ) = /, let 0 < A r < • • • < Ai denote the corresponding 
eigenvalues. A computation similar to that performed in the proof of Theorem [4] below 
shows that these points perform the joint diffusion process: 


2 2 y. _|_ y . 

dXi = —=Xidbi + (—a + —)A idt + \ ^ ^ —- ^-dt. 

V P P At — Aj 


(3.16) 


For the stated almost sure conclusion, (13.161) is likewise viewed as a two-parameter family 
of processes A* = A i(t;s) for t > s, all coupled through being run on the same Brownian 
motions bi,... ,b r . 

Passing to logarithms y* = log A*, the upper and lower paths can bounded as in: 


and 


7i(*;s) > ~bi(s)\ + (-a + r- l)(t-s), 


7 r(t\s) < —J=\b r (t) - 6r(s)| + (-a - r + l)(t - s). 

Ai +A ? - 


(3.17) 


(3.18) 


These estimates follow upon substituting > r — 1 or ^-a' — — ' r + 1> 

respectively, into (13.161) . Next, since 

A k + A,- A fc+ i + Xj ^ 0 X k + A fc+ i 


A k — Xj J 

j^k J jy^k+1 


k Aj ., , Afe-)-i Xj 


<2 


Afc Afc+i 

we have that y*, — y^+i is path-wise bounded by the solution of 

dz t = db t + 2 dt + 4(e zt — 1) _1 dt, 


again started from t = s. Here b t = -2=(& fc (t) — b k+ i(t)). Since -T_ > the process z t 
is almost surely positive. It is also bounded below by b t + 2 1, and so there is an almost 
surely finite time r after which z t > t. These observations imply that J 0 °° e jf t _ 1 is finite with 
probability one. Hence, 

7 k - 7fc+i < - b k (s )I + ~^=\b k +i(t) - b k+1 (s)\ + 2 (t - s) + C, (3.19) 
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with an almost surely finite C = C(bk,bk + 1 ). 

Putting together (13. 171) and (13.191) . along with the law of the iterated logarithm, we find 
that 

r —1 

7 r(t] s ) < 7i(f; s) + y^(7 fc+ i(f; s) - 7 fe (f; s)) < (-a + r - l)(f - s) + C(1 + (f - s)^ + ) 

fc=i 

with another almost surely finite constant C = C(bi ,..., b r ). The conclusion is that the 
integral f™ e^ rt \\Aj 1 At\\ 2 dtds converges along with / 0 °° / s °° □ 

For later we extract the following from the proof of Lemma fl2l 

Corollary 13. There is a finite random constant C such that, for s <t, 

I \Aj 1 A\\ < Ce^ h-s)+c(ts)h +^ (3.20) 


holds almost surely. Back in the original coordinates this implies that, also almost surely and 
with 0 < x < y < 1, 


WA^AyWKC 1 


y 


X 


0 + 1 


(3.21) 


fore > 0 any chosen (small, deterministic) constant and (a finite, random) C' = C'(C,e). 


3.4 Rewriting the matrix product 


A bit of convenient preprocessing casts the matrix product A n {x), see again (13.41) and (13.611 . 
as an approximate Euler scheme for the claimed limiting sde (13.71) . 


Lemma 14. With A = - there is the representation 


A n (x + A) - A n (x) = A n (x) 




-a + 


I + £\, 


(3.22) 


Here Gu = G n>r is an independent sequence of approximate F -Gaussian matrices. All off- 
diagonal elements are independent unit W-Gaussians, while the diagonals are sums of inde¬ 
pendent x variables: 


( Gk)ii — (Dk)u + (~ E[(Dk)u + (Ofc)jj], (3.23) 

recall the definitions (13.1|) and (13.21) . The error matrices 8k = £ n ,k form another independent 
sequence. Each has a deterministic plus noise decomposition £k = + £) satisfying: 

Kll<7 -E7 = 0, E||7|| 2 '< 7 -(3.24) 

[n — rk) 6 ' z (n — rk) zp 

for integer p > 1 with constants c = c(/3, a, r, d , p) granted that n — rk > d 1. 


18 



















One first byproduct of the above (coupled with say [25, Theorem 11.2.3]) is the advertised 
convergence of A n (x) to A x . The representation (I3.22p will also be key for the norm control 
of the associated kernel. 


Corollary 15. The Markov chain A n (x ) converges pathwise (in the usual Skorohod topology 
of uniform on compacts in [0,1)^) to the diffusion A x defined by (13.71) . 


Proof of Lemma If. We have to show that 


O k D£ ! = I + 


--G + 


a — 


y/n - rk 2 (n - rk) 


I + £, 


(3.25) 


with random matrices G and £ having the stated properties. For convenience we are shifting 
the index on the OD^ 1 product to k from (fc + 1). 

Denote by F^ = F H) k,b the r x r diagonal matrix with nonzero entries -^Ex^n+b-rk-i+i), 
i = 1, 2,..., r. Then, recalling the definitions of the D and O blocks (j3.ip ~ fl3.2p . we write 


Ofc-Dfc+i := {Ff.fi + H)(F k:a — if) -1 , 


(3.26) 


making two more definitions. H is upper triangular with independent F-Gaussians off diag¬ 
onal and centered negative x variables (pairing with Fk )a entry-wise) on diagonal, and F[ is 
lower triangular, again with independent F-Gaussians off diagonal and centered x variables 
(pairing now with on diagonal. Extracting 

G := H + H 


is the guiding principle. 

By the resolvent identity we have that, 

(K - H r 1 = F- 1 + FpHFp + F-'(HFp) 2 + (F„ - Hy^HFp) 3 , (3.27) 

where we drop the k indices from here on. Substituting the polynomial approximation 
inherent in (I3.27j) into (I3.26j) gives 

(F„ + H)(F a - H)-\l - (HFpf) (3.28) 

= FoFp + HFp + F„FpHFp + F a FpE(HFpf 
+ HFpHFp + HF~\HFpf + F a Fp((HFpf - E (HFpf). 

The first line on the right hand side of (13.281) contains the leading order in (13.251) . while the 
second line is mean zero (H and FI are independent with mean zero entries) and will be 
absorbed into £. 
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To deal with the first line we need the estimates 


1 


W - (i 


-)i\\ = o 


\F r 


-l 


2 {n — k) J 11 ~ \{n — rk) 3 / 2 ) ’ 

1 i\\ = o f 1 


(3.29) 


Vn — rk~ " \n-rk ) ’ 

in the first three terms, and, given that E(iJi r ” 1 ) 2 = X“ 2 Ei/ 2 , 

||EiJ 2 - J-/|| = O 


2/3 


n — rk ) ’ 


(3.30) 


in the last term. Of course, the same estimates would hold for H replacing H in the last 
display. All of this comes down to the simple appraisals 

E Xt = Vi-^ + o(-^J , VaiXt = ^ + o(jj , (3.31) 

for t large enough. In summary, 


F 0 F-' + HF- 1 + F„F-'HF-' + F 0 Ff'E(HF-') 2 

1 , (—o + l//3) 


(3.32) 


= I 


\Jn — rk 


(.H + H) + 


2 (n — rk) 


X + S’, 


with diagonal I = / + 0((n — r/c) -1 / 2 ) and a mean-zero S' satisfying the conditions laid 
out for the Si terms. For instance, S' contains terms like H(F~ l — -j==I) with expected 
norm powers controlled by the second line of (13.29j) and the fact that the entries of H and 
F[ have bounded moments of all orders. For the latter the estimate (I3.3ip is supplemented 
with E |Xt ~ Fxt\ 2p < ( cp ) p , for t > d and c = c(c'). 

Returning to the second line on the right hand side of (13.28j) . we have already remarked 
that all terms have mean zero and the expected norm estimates follow similarly to those we 
just described. Now use || Xb-Xj^y 1 1| < c and 11 1 11 < ~^== along with the moment control 
on the entries of H and H. 

Finally consider the remainder from (j3.27j) and (I3.28|) . or 


S := (F„ + H)(F a - H)-\HF-') 3 - 


We will show that: as always in the regime of large n — rk , 


[F 0 E(F„ - ffJ-'fffF- 1 ) 3 ] = O 


4 




nj \y (?t. — r*A:) 3 /2 

that is, it is diagonal with diagonal entires bounded in kind, and that 


E|| H{F a -H)- l {HF~ l Y\\ 2 v = 0 


[n — rk) 4 P 


(3.33) 


(3.34) 





























With these established, F 0 E(F a — H)~ l (HF~ 1 ) 3 can be added to £° and the mean-zero 
matrices inside the norms in (13.34ft can be absorbed into to £ 1 and the proof will be complete. 

Starting with (j3.33j) . that E[(F a — H)~ l (H F^ 1 ) 3 } is diagonal follows from the Cayley- 
Hamilton theorem along with the fact that EH P is diagonal for any integer power. To see 
that latter, expand out ( H p )ij = H, 0ll H nt2 ■ ■ ■ Hi ^ , with i 0 = i, i p — j and note that unless 
i = j there is at least one non-repeated (mean-zero) H iki . This is simply because H is 
upper triangular and so i k < i k +\ along the expansion. The estimate itself can also be seen 
term-wise, by say writing out (F a — H ) _1 using Cramer’s rule. The bounds in (I3.34[) are 
similar. After an application of Cauchy-Schwarz both bounds come down to the inequality 
E||(F a — H)- 1 || 4p < (n _ c rk]2p which yet again follows from expanding things entry-wise and 
using E\Hij\ q = 0(1) for all i,j and q. □ 


3.5 Tightness up to good times 

We prove that for any e > 0, there is a c 1 such that 

||A n (x)|| 2 < O n (l - x)~ 1+ ^~ £ , for x < 1 — c^-^, (3.35) 

n 

where C n = C n (e) is a tight sequence of random variables. Considering partial sums we will 
then conclude that 


Corollary 16. For e > 0 it holds 

\\A n (x)~ l A n {y)\\<C n (\—for 0 < x < y < 1 - (3.36) 

\ 1 — x J n 

with a deterministic c = c(e) and a tight random sequence C n = C n (e). 


This should be compared to the estimate in Corollary [13 for the continuum case, and 
finishes Step 2 of the proof of Theorem (TU Note that a tight random upper bound of the 
form \/nDf^ x / r \ < C^(l — x)~ 1,/2 / poses no difficulty, and also that the righthand side of 
(I3.36P is in T 2 [{0 < x < y < 1}, jh^dx x dy] as long as e < 

As for (I3.35h . the proof is inspired by p6]. From Lemma fl4l we can express A n (x) = 

n£ /r] ^ with 


j. (~a + 4) 

Z k = I + .- - G k + -- P—I + S k , (3.37) 

Vn - rk 2 (n - rk) 

where the G k are independent approximate F-Gaussian matrices and the error matrices £ k 
possess the decomposition £ k = £ k + £l in which £ k are (deterministic) diagonal matrices 
satisfying ||£°|| < c(n — rk)~ 3 ^ 2 and E£\ = 0,E||T^|| 2p < c(n — rk)~ 2p , for n — rk > c'(c), 


p > 1. 
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(3.38) 


Toward controlling the norm of A n (x), choose a fixed v G S r X (F) and consider 


[nx / r] [nx / r] [nx / r] 

v j A n (x)A n ( X yv = JJ Z k )( JJ Z k ) f v = JJ v f k Z k z£v k , 

k =1 k=l k =1 


where the introduced vectors v k G S r *(F) are defined by 

(Z*-i4-i)( z *-i z L 2 ) ■ ■ ■ (Zi Z\)v 


Vk = 


for k > 1, v\ = v. 


ii(^_ 1 zL 1 )(z*- 1 d- 2 )---(ZidHi 1/2 

In particular, each v k is T k ~\ measurable. With this set-up, (I3.35P will follow from: 

Proposition 17. Set 


[nx / r] 


Qx = Q x (v) = lo § [ v i z k z i 


k =1 


' k V k I - 


(r — a - 1 + |e) 
n — rk 


(3.39) 


the centered logarithm of (I3.38j) . and h{x ) = (1 — log(l — x)) l / 2+ . Then, with choice of c 
depending on e > 0, it holds that 


0 < sup h(x)Q x < C n 

x<1 - c l£EIL 

— n 

for C n a tight sequence (dependent on c but independent of v). 


(3.40) 


Indeed, just write: for x in the prescribed range and U£, £ — 1... ,r the standard basis 
vectors, 


ti(A n (x) ] A n (x)) = e 


E [nx/r]l r—a — l+e /2 
k =1 \ n—rk 


£=1 


Qx(up) 


< 2 lo §pTry) 1/2+ ) 

<C' n { l-x)~ 1+ ^-S 

The first inequality uses (I3.40j) in the second factor and a rough upper bound in the first (valid 
for large enough n). The second inequality just introduces a new tight random sequence C' n , 
based on C n , e, and r. 


Proof. First, since we only require a tight random constant bound, we are free to restrict 
to an event h2 n with probability tending to one as n —y oo. Let be the event on which 
all the F-Gaussians appearing in the collection of Z k matrices k < 1 — are bounded in 
norm by c! log n, and all the x random variables are within c' log n of their mean. The claim 
is that by choice of c' the probabilities P(f2 n ) are in fact summable, and on Q n it holds that 


w z k z i v ~ !i < 


d'y/logn 
yjn — k ’ 


(3.41) 
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where c" depends on c! only. In the next subsection (“Throwing out the tail”) we prove a 
version of this claim in a more difficult setting — see in particular (13.541 and its subsequent 
proof — and so do not repeat the argument here. 

By taking c sufficiently large the inequality f!3.4ip can be taken to read \v^Z k Z' k v — 1| < 5 
with a choice of e > 0 for all k < n — clogn. This leads to the bound: on Q n and for 


[nx/r] [nx/r] , [nx/r] 

£ log (4z k z' k v k ) < £ (vlz t z' t v h - 1) - £-4 £ (»*Z t ZK - l) 2 , (3.42) 

k= 1 k=l k =1 

as log(l + x) < x — |(1 — 5)x 2 for x < 5. 

Next denote 

Z t Zl = / + X„ + Y k + g / + £l (3.43) 

for 

W = =4=(G* + G»), n = -4—G^Gt (3.44) 

Vn - rk n — rfc 

and 

S' k = (S k + £*) + + etcetera, (3.45) 

yet another error term. Also introduce the shorthand: X k {y) = v'X k v, Y k (v) = v'Y k v, and 
so on for any v 6 S r_1 . 

By choosing S — |e the inequality (13.42(1 implies that 


[nx / r] 


( AA(ufc) + (T'fc(t’fc) - 


fc=i 


( r ~ 1 ~ ?) 

(n — r/c) 


) + (!-*)( 


/ 3(n — rk ) 2 


-X fc (u fc ) 2 ) (3.46) 


[ni/r] 


X] (4M + + 2(1 - S)X k (v k )W k (v k )), 


k =1 


in which ITfc(w) = Y^n) + + £fc(w). Intuitively, line one of ((3.46(1 produces the leading 

order with the terms in line two having greater built-in decay. 

As for that first line, we consider each term separately. That is, we demonstrate tight 
random constant bounds on: 


sup h(x) 

x<1 - c l2ML 

— n 


[nx / r\ 

y ^ x k {vk) 1 1 

k =1 


(3.47) 
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and 


sup h(x) 

a ,<l_ c logn 
— n 


sup h(x) 

— n 


[nx / r] 

E uk) 

k=l 
[nx/r] 


E 


—' j3(n — rk) 


(n — rk) 


x k (y k f 


(3.48) 


Of these, the near Gaussian sum in (I3.47P is the most delicate, requiring the type of quanti¬ 
tative iterated log estimate carried out in the end of Section 2 of [TS]. The Y k and X k sums 
can be dealt with more crudely as follows. 

Using (13.3ip one easily finds that 


E Y k (v) = 


1-4 


{n — rk) 


+ 0 


(n — rk) 2 


EAT (nr = 


/3(n — rk) 


+ 0 


(n — rk) 2 


uniformly in v G S r 1 . And since cl ° sn )/ r l ^ n 2 rk yi = O j, the sums in (13.48P can 
be replaced by Y^k=i^ Vk(v k ) with 


Vk(vk) = Y k (v k ) - EYl(v k ) or rj k (v k ) = EX k (v k ) 2 - X k (v k ) 2 , 


respectively. In either case it holds that Er] k (v k ) = 0, Er] i (vi)r] k (v k ) = 0 for £ ^ k, and 
E\rj k (v k )\ 2 < (ll d)' rk) -> with a d independent of v k (and so the original v). Taking second 
moments and bounding the sup by another sum, we have that 



[n/r] k 

< d h 2 {k/nr) ^^(n — rt)~ 2 

k =1 1= 1 


< C 


// 



_ dy _ 

(1 -2/)(l+ I l°g(l — £/)| 1+ )' 


which is finite. 

Line two of (I3.46P requires more of the same. In fact, the sum over v\(8 k + S' k )v k (the 
hrst term in E' k {v k )) can be treated by the exact same L 2 bound just performed. Subsequent 
terms simply require taking higher moments, making use of the estimates (I3.24p from Lemma 

m 

Returning to Y^} k =i^ X k (v k ), the first ingredient is the following bound for chi-variables 
(appearing in the diagonals of the X k matrices): 


E [e Xxt ] < e AE »+^ for all A e M. 


(3.49) 
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This follows from the log-concavity of the Xt density, which yields a Log-Sobolev inequality. 
With this in hand, the well known Herbst’s argument gives (I3.49j) . see for example [IT] . 

Then, for X = a given Xk (and likewise G denoting the corresponding approximate 
Gaussian matrix Gk) and fixed v 6 S r ~ 1 , 


E [e AX(w) ] = E 




E 


X 7=jf(Gyt,J Wj .+Gh«. V t) 


< (Xi p(n-rk )\ Vi \ 4 p(n-rk)\ V i\ 2 \ V j \ 2 

< (n-rk) 


(3.50) 


Recall the definition of G in (13.231) from Lemma [TU the first overestimate above comes from 
using (13.49P in the first factor of (13.50]) . The outcomes is that 


e A Xk= i x k(t-’k) W/3)\ J2k=i n-rk j f or 777 , = 1,2,..., [n/r\ 


(3.51) 


is a supermartingale. Now Corollary 4.2 of [7] implies that: with probability one, 


lim sup 

n—>■ oo,x n ->-l 


f ST' l nx n 

\Xk=l 


nxn/r] 1 


n—rk 


iogio g (Er=r 


[nxn/r] 1 




(3.52) 


n—rk } 


with a constant c(/3) that can be explicitly worked out. Here x n indicates any sequence 
tending to one as n tends to infinity. This is more than is needed to complete the proof. 
Here we are applying Condition 4.1 of the reference to (13.5lj) with < h r (s) = | s 2 and B t = 

B n ,tn = 2 \J (2//5) Y^k=i'^ TiXrk w dh t n > 1. The obvious adaptation of the ideas there to 
the “triangular array” of variables here yields (I3.52() . □ 


3.6 Throwing out the tail 

We establish Step 3 of the proof of Theorem [III 
Lemma 18. For any large enough constant c, 


in probability. 


\\k ntr (x,y)\\ 2 dydx 0, 


0<x<y<l, J/> 1 — 


(3.53) 


Proof. As noted already, the scaled pre-factors \/n( 1 — are bounded by a tight 

random constant multiple of the identity. We can then consider the slightly simplified kernel 


k n A x ,y) = 


\/l — x 


[ny/r] 

T °* D F 


k= nx/r] 


\/l - X 


A n (x) A n (y ) 
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in place of k ntr (x,y). 

The proof comes down to constructing events kl n with probability tending to one on 
which 

ikw-'a.mii < n ( 1+c V i Sr ) ’ (3 - 54) 

nx<k<ny 

for the prescribed range of x, y and d is a numerical constant, independent of /c, n or c. 
This is used in conjunction with Corollary [16] of the previous section which yields events Vt' n 
(there referred to as Vt n ) of probability close to one on which 

WA^xY 1 A n (y)\\ < d , for x,y < 1 - (3.55) 

e < and now another deterministic constant d. We are translating the tight random 
constant bound in that statement to a deterministic one on a sequence of nice events. The 
goal then is to show that on the disjoint union of Q n and Q' n the appraisal (13.53ft holds 
(for again k n>r ). That c be large enough is required exactly because we use (I3.55jl with 

PTO «i. 

So, with both (13.54)1 and (13.55)1 in place, write: 

rl rl rl-Sn p 1 

\\k n , r (x,y)\\ 2 dydx = / / \\k n , r ( x , y)\\ 2 dydx + / / \\k n , r (x, y)\\ 2 dydx. 

J 1 — 5 n J X J 0 J 1 — S n 

0<x<y<l, y>l— 5 n 

with S n = c^. 

n n 

For the first term we have the estimate on the right hand side of (13.541) , 



|| A n (x) 1 A n {y)\\ 2 < vr ~^\ with n n = d'n log log n, 

for large enough n and constant c". This gives 


>1 /■! 


T —Sn 


1 — X 


\\A n (x) 1 A n (y)\\~dydx < 


1 

x 


/thrift v 


= 2 


0n / e V KnX _1 


K n X 


v dydx 
1 


(3.56) 


y/ldnX 


dx < 


which tends to zero like 1/n 1- . The final inequality rests on the fact that is increasing 
for large x, while e^ < 1 + yfx + O(x) for moderate values of x. 

As to the second term we have: 

rl — Sn /•! 1 


o>/ Kn^ri 


Kr 


1 — S r 


1 — x 


114, (x)A n (y)\\-dydx 


(3.57) 


’1 — S n 


■1 


< 


1 — X 


\\ A n (x)A n (l - 5 n )\\ 2 dx / 114/(1 - 5 n )A n (y)\\ 2 dy 


< 2 dS, 


/f -l+a±i- 2 e 


'1 —Sn 


dx 


(1 -x) ! 


X 


' 1 —Sn 

Zi \Adddi 
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Here we used (I3.55[) in the first factor, and the same estimate just employed in (13.56ft in 
the second factor. Completing the remaining integral in gives an upper bound of order 
which decays like a small negative power of n as long as the L 2 condition 


q + 1 


— 2 e £>\/^ 


e < 


2r 


holds. 


It remains go back and construct the events Q n . Along with the standard Gaussian tail 

_ ^y2 ^2 

inequality in the form P (\g\ > 7 ) < — we will also need that, for any Xt random variable 

with t > 1 , 

P (|Xt - E Xt \ > SE Xt ) < 2 e"^ 2 . (3.58) 


This follows readily from the inequality (I3.49jl along with the bounds 


\j t ~ \ - - v ^- 

The lower estimate requiring t > 1, the upper estimate is Jensen’s inequality. Clearly 
is most useful for t j" 00 . For t = 0(1) we get by with 


(3.59) 

d338]) 


p (xt <S)< s\ 


(3.60) 


for small 5 > 0 and t > 1 (just integrate the density). 

Now, dropping indices for a moment, we write Ok = d + G and Dk = d + G where d and 
d are diagonal matrices of the independent x variables: 


di 


^-^X/3(n—rk—i+l): di 


1 

~^^Xp(n+a—r{k—l)—i+l)i 


leaving G, G to denote the strictly upper/lower matrices of independent Gaussians (gi G G, 
gi G G ). Denote by Tk the index set of all di,di,g t gi variables occuring in Ok and Dk+i■ 
Decomposing as in, 


OkD^ -I = (dd- 1 -I) + Gd- 1 - (/ + Gd-^dd-^I + Gd~ l )- l Gd~\ (3.61) 


we see that on the event 

D(J, k) = jl&l, \gi\ < 2 A/loglogn; di < y/n- k{ 1 + J); di,di > 


with 5 = Sk = 4 v/ 1 °g-P| n assumed less than there are the bounds 

x/n—k 1 

|| dd- 1 - /|| < 8 ^ 1 . ° glog n , and HGd" 1 !!, HGd- 1 !!, WGd^W 
yn — k 


y/n — k{ 1 — 


i&V k 


4\/log logn 
y/n — k 


That is to say, 

|| OkDk+i — J|| < d — , I =-, on fife £l(5k, k) over n — k e [16 loglogn, clogn], (3.62) 
y/n — k 
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with the lower bound on n — k providing a uniform estimate on the appearance of || (/ — 
Gd -1 ) -1 || < (1 — ||Gd -1 ||) -1 in the last term of (13.611) . Then by the Gaussian tail bounds, 
recall in particular (I3.58P coupled with (I3.59p . 


P G(<5fc, k) c , k e [n — clog n, n — 16 log log n]j (3.63) 

( g—2 log log n 

Vlog log n 

with a constant d = c'(r, c) on the right hand side. 

Next, for n — k of the same order as log logn, OkD A f 1 no longer concentrates well about 
the identity. Instead, we use that on n — k < 16 log logn, the typical Gaussian is now 
0(a/ log log log n) and on the event 


-f- 6 


—8 log logn 


n\k) = \ \gi \, \gi\ < 2 A/log log logn; di < 2 y/n - k,d t > -y/n-k 


ier k 


we can overestimate 

IIO^JI < (|| dd~ l \\ + || Gd-'W) ||(/ + Gd- 1 )- 1 || (3.64) 

^ , A/log log log n /log log log nV /2 ^ . A/log log n 

< c- . —x - - - <c— - 

\Jn — k \ n — k J y/n — k 

The second inequality is just an oo-norm bound on (/ + GcG 1 ) -1 , the entries of which are 
polynomial (of degree at most r) in the entries of Gd -1 . The last inequality holds for n big 
enough by adjusting the constant d. And similar to (13.631) . 


p|ufi'(fc) c , k e[n — 16 log logn, n — 

log log n 


< c" 


16 log log n 

—2 log log log n II \ ^ 


A/log log log n 


+ c" e_3fc ’ ( 3 - 65 ) 

k= < y 


which goes to zero for any 7 = 7 n — y 00 . Note here we are we are using the bound (13.581) 
with 5 = 1/2 (that is, f 1'(k) = O'(5 = 1 / 2 , k)), along with the fact E \t > \t f° r sufficiently 
large t, see again (I3.59P . This gets us down to n — k = 0(1). 

For the final stretch, we note that we have the same type of bound on as in 

(I3.64p on an event Vt"(k) defined similar to Vt'{k) except replacing the conditions on dj, di 
with di—r < yj2 log log logn and !^—r < 1/ \/log log logn. With k now ranging from n — 7 
to n, the adjustments needed in what corresponds to the second term of (13.651) are as follows. 
The probability of the upper deviations of the maximum di is bounded in the same manner 
as before by 0 ( 7 e~ logloglogn ) = O ( Iog / n ). For the probability of the lower deviations of the 
di we use (I3.60p to control this by 0((log log logn ) -1 / 2 Y^l=i k k ^ 2 ). Since 7 —> 00 as slowly 
as we want, both of these bounds tend to zero as n —> 00 . 

The advertised event can then be defined as the intersection(s) of G(5fc, k), dl'(k) and 
Q"(k) over their respective ranges. □ 







































4 Riccati correspondence 


We prove Theorems [3] and [4] providing hitting time and PDE descriptions of the spiked hard 
edge laws, as well as Corollary [5] which highlights the dependence of these descriptions on 
the “non-degeneracy” parameter. 


Proof (sketch) of Theorem 0 The argument to identify the q diffusion, with the appropriate 
starting point, is can be taken verbatim from Theorem 2 of [18] (with considerations for the 
boundary condition “at infinity” fixed by the subsequent erratum). 

To see the PDE connection is to note the (hyper-elliptic) differential operator 


C := 


d_ 

d/i 


2 2 cP_ 
(3° dc 2 


((a + |)c - c 2 - e-)|, 


introduced in (11.121) is simply the space-time generator for the joint process (x, q x ). Unique¬ 
ness follows from the standard martingale argument, starting from the observation that a 
solution to the backward equation CG = 0 produces a local martingale, x G(x,q x ). So 
then, with r the passage time to zero, G(/z, c) = E (^ c )G(t A t,q TAt ). If G is also bounded, 
the t —y oo limit can be taken under the expectation. The resulting equality, along with the 
boundary conditions (j 1 . 1 3 [) . force G to be the probability that the path never hits zero, that 
is, G = F. 

For the higher order distributions things are worked out inductively. Consider F\, and 
let K^cfdrj) denote the (improper) distribution of the passage time to —oo starting from 
time/place (/z, c), allowing for the expression 


Fi(/i,c) = F(/i,c) + 



K^ c (drj)F(rj,+oo). 


From the same basic theory just employed, k, which vanishes to the left of /z, also satisfies 
the backward equation Cn = 0. The claimed boundary condition follows by observing that, 
as c l —oo, ac n, c (dr}) tends to the unit mass at 77 = /z and F(rj, + 00 ) is continuous in 77 . □ 


Proof of Theorem [7} The approach mimics that for the one-dimensional case. Introduce the 
truncated operator <8 L on L 2 [A4, [0,L]] by 

rL rxAy r-L 

(®l)/(*)=/ / S(dz)M(dy)f(y) + C- 1 M(dy)f(y), 

Jo Jo Jo 

for / : [0, L\ (-)■ F r . Since S x and A4 X are continuous and norm bounded over [0, L\ any 
/ G L 2 [J\A , [0, L]\ which also satisfies the eigenvalue problem f(x) = A©l/(x) is readily seen 
to be C 1 and must also satisfy: 

/'(0) = Cf{ 0 ), fix) = \S X f L M(dy)f(y). (4.1) 

J X 
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This last condition holds entry-wise, so in particular f'(L) is the zero vector. The important 
point is that the same estimates on the Hilbert-Schmidt norm of k r (Lemma fl2ll shows that 
(55z, —> © almost surely in operator norm. In particular, the desired eigenvalue counting 
function is the L —> oo limit of that for 0^. 

The next step is to cast the condition that any fixed A is an eigenvalue (of 0 l) in terms of 
an initial value problem, Since the paths of A4 X are (almost surely) bounded over [0, L], any 
/ G L 2 [M] is also in L l [M], And if such an / satisfies the eigenvalue problem, it will then 
be continuous and, by repeating the argument, entry-wise differentiable. Taking derivatives 
we find that 

/'(*) = A S x f M(dy)f(y). (4.2) 

Following this by taking an Ito differential, we have the stochastic differential equation, 

df(x) = y/2A~ ] dB x A ] x f(x) + ^(a + r - 1 + ^)/'(x) - Ae -rx /(a;)^ dx, (4.3) 

which can be closed by including df(x) = f(x)dx along with the equation for A x . Note 
that A x is almost surely invertible for all x > 0. Here B x = 4= (F? X + H}) is the canonical 
j3 = 1,2,4 invariant (or “Dyson”) Brownian motion, and we used the equation for S x : 

dS x = \/2A~^ dB x A~ l + (a + r — 1 + ^ )S x dx. 

t-* 

The latter follows from Ito’s rule in the form dM~ x = —M~ 1 dM x M~ l +M~ 1 dM x M~ 1 dM x M~ 1 . 

The “twisting” of the Brownian increment by A\. in (I4.3j) is a complication particular to 
r > 1. Even without this, the fact that the noise appears multiplicatively seems to preclude 
reducing things directly to the classical (deterministic) matrix oscillation theory as was done 
for the multi-spiked soft edge in [4]. On the other hand, by the Markov property and (path- 
wise) uniqueness for the system x i —> (A x , f(x), f'(x)), it cannot happen that both / and /' 
vanish simultaneously. Further, by (14. 2 j) we see that f'{L) = 0 is precisely the condition for 
the predetermined A to be an eigenvalue of &l. 

To make computations it is convenient to lift things to a matrix system. Define F(x) G 
F rxr as the (unique) solution to 

dF'(x) = V2A~^dB x A x F'(x) + ^(a + r — 1 + —)F'(x) — A e~ rx F(x)^j dx , (4.4) 

dF(x) = F'(x)dx, 
with initial conditions 

F(0) = I, F'( 0) = C. 

Evolving this system forward (including the evolution of A x ) for fixed A, the outcome is that 

{A is an eigenvalue of 0 l} = {F\L) is singular}. 
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Note that the uniqueness argument used for (/, /') shows that F and F' cannot be singular 
at any common time. 

The standard Riccati procedure would be to now introduce P(x) = F\x)F~ 1 (x), which, 
away from the singular points of F(x), solves 

dP x = v / 2^4' t clB x A ] x P x + (r - 1 )P x dx + ip(P x )dx, P 0 = C. (4.5) 

ffere the definition of 

ip{P) = (a+ jj)P P 2 — Xe~ rx I r , 

from the statement of the theorem is simply lifted to act on matrices. While (14. 5 p recovers 
the r = 1 hard edge process (as it must), for r > 1 one must (again) include the auxiliary 
process A x to close the equation. Additionally, the hope would be to reduce the dimension 
of the system by translating the singular point count of F'[x ) over [0, L\ to an eigenvalue 
count of P x . But P x is not even say symmetric in law. 

The program is salvaged by introducing Q x = A\.P X A~, which satisfies 

dQ x = dB x Q x + Q x dB ] x + ftr(Q x )/ r - ^ Q x + (i - l)diag (Q x ) + dx, (4.6) 

with Qo = Pq = C, and diag(Qa;) denoting the diagonal matrix with u-entry (Q x )u. We will 
hold off on the derivation of (14.61) . One can immediately deduce that Q x ~ Q\ and hence 
Q x has r real and almost surely distinct eigenvalues q\,, q r . So, at least F'(x o) being 
singular for Xo G [0,L] is equivalent to some qi(x o) = 0. Furthermore, while (14. 6 p is not 
transparently invariant under “rotations” (except in the case /3 = 1), the eigenvalues form 
their own Markov process governed by 

dqi = -ynqidbj + ( V>(^) + q,, ^ ^ + <lk ) dx , (4.7) 

^ v U qi ~ qk J 

with bi,... ,b r (new) independent standard Brownian motions. 

The derivation of (14.71) is also put off to later, but taking it for granted the proof is fin¬ 
ished as follows. A standard calculation shows the interaction term q % precludes 

intersections among the qd s, see for example |R Proposition 4.3.5] If] So, choosing by con¬ 
vention q*(0) = q with Ci > C 2 > etc. the eigenvalues of C , the g* remain ordered until the 
possible explosion of q r to — oo. Next, the presence of the term —\e~ rx in each drift term 
(recall the definition of ijj) shows that g* are path-wise decreasing in A. This means that new 
zeros among any of q r on [0, L\ occur at the endpoint L and move from right to left. The 

5 This reference treats the standard Dyson Brownian motion with interaction term q -- qk > hut the 

considerations are basically identical. 
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same then holds for for the singular points of F', and so the eigenvalues of 0^. (Thus, after 
the fact we see that F cannot become singular before the first singular point of F'.) 

This establishes the correspondence between F^ and the probability of the appropriate 
number of vanishings among the family q ( . = {q r i}*=i,the substitution A = e~^ having 
the same effect as in dimension one of shifting the starting time from x — 0 to x — fi. The 
rest is also much the same as in dimension one: the partial differential operator appearing in 
(I1.15P is the space-time generator of (x, q x ), uniqueness is proved by an identical martingale 
argument, and so on. 

Finally we return to (14.6p and (14.71) . For the former, Ito’s formula gives 

( dA f x )P x A~ f = -dB\Q x + (-^ + )Q x dx , 

A\(dP x )A~^ = ( dB x + dBl)Q x + (r - 1 + L(Q x ))dx, 

AlP x (dA-') = Q x dBl + (j + P)Q x dx, 

and, at second order, 

(i dA ] x dP x )A~ ] = -dB\(dB x + dB\)Q x = -(r - 1 + )Q x dx , 
A\(dP x dA x ] ) + (dA\)P x (dA x ] ) = dB x Q x dB\ = tr (Q x )I r + (i— l)diag(Q x ), 

Summing the right hand sides produces (14.61) . 

For (14.7p we start by introducing some more notation, rewriting (14.6p as 

dQ x = dB x Q + QdB\ + F(Q x )dx, with F(Q) = i>(Q) + tr(Q) - ^Q + (j - l)diag(Q). 

jj jj 

The approach is classical, taking differentials throughout the spectral decomposition Q x = 
U},q x U x (using now q for the diagonal matrix of eigenvalues qi, g 2 • • • )• To be concrete, we 
carry things out for (3 = 2§| 

Set 

dB x = U x dB x Ul and dU x U\ = dN x + dG x . 

Note that B x is not simply a copy of dB (as it is for /3 = 1). The second definition is a 
Doob-Meyer decomposition, with N x a local martingale and G x of finite variation. Further, 

dN\ = — dN x , and dG x = —~dN x dNl, (4.8) 

6 This computation is more or less reproduced in m- It should be clear afterwards how everything goes 
through in the quaternion case (in the real case there are no subtleties — the diag(Qa;) term is not present 
and the Brownian motion is isotropic). 
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since d(U^U) = 0. With this in hand, suppressing the time index, an application of Ito’s 
formula in dq = d(UQU t) produces 

dq = dBq + q dB^ + UF(U^qU)U^ + (diVq + qdJV^) + (dGq + qdG^) (4.9) 
+ dNqdN^ + dNdBq + dNqdB* + dBqdN ] + qd&dNK 


And as martingale component of the right hand side must vanish off the diagonal it follows 


that _ 

q k dB ik + q t dB\ k q k dB ik + q4B ki 

ai\i k — -—- 

qi ~ qk qi ~ qk 

for i 7 ^ k. 

Next, write out (I4.9h on the diagonal with the help of (14.81) : 


(4.10) 


dqi = qi(dBu + dB^) + Fu( q, U) (4.11) 

- qi)dN ik dN ik 

k^i 

H - ^ + q k dBik)dNik H - ^ ^ (^QjdB k j -\- q k dBi k ^dN\ k . 

k^i k^i 


Here 

1 i r r 

Fu( q, U) = ip{qi) + E^+^-jEE qk\un\-\u ki \ 2 , ( 4 . 12 ) 

k^i k=1 1=1 

1 r 

= t%) + qk + 2 _ qk ) 

k^i k^i 1= 1 

and u a b are entires of U. 

We readily have that B„ + B\ t is equal in law to y/2 times a standard Brownian motion. 
The higher order multiplication table is more complicated: 

_ _ i r 

dB ik dB ik dB k idB k i 1 — ^ ^ l^al • 

z i=i 


and 

^ _ i r 

dB ik dB ki dB ik dB k i — \ ^ \ua\ • 

z t= l 

Invoking these facts along with (I4.10|) in (14.lip shows the final two lines of that equation are 
equal to 

y^(gi - gfc ) -1 (g?|d.B w | 2 + q£|d.B ? fc | 2 + qiq k (dB ik dB ki + dB ik dB ki )^j 

k^i 

=E fvv - 5 E(« - ®) E M 2 M 2 - 

k^i k^i t=l 
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When added to (I4.12p . the second term on the right hand side above cancels the non-invariant 
drift term in that expression. The derivation is completed by noticing that the first term 
here and second term in (14.121) combine to produce + q -ql ) = 9* Thk^i q—H • D 

Proof of Corollary The point is to show that for either of the processes (ll.lOj) or (11.141) . 
that, given a passage to zero has occurred, explosion to — oo occurs in finite time with 
probability one. 

For r = 1, or the (j 1.10 [) process, this was already noticed and proved in [20]. To summarize 
the main idea, first note that if q x hits zero it immediately becomes and stays negative for 
all time. Then, on the event r 0 < oo (with To that hitting time) the change of variables 
u x = log(— q x q - T0 ) yields a process on the whole line, satisfying 


du x 


2 

7 ? 


db x + [a + e 


+ e~ T °- x ~ Ux ]dx, 


r 0 e (—oo, oo). 


And given the sign change, one wants to show that u x hits +oo in finite time. But u x is path- 
wise lower bounded by the the homogeneous process v x defined by dv x = db x + [a + e Vx ]dx 
(run on the same Brownian motion). Feller’s test can now be applied to show r +00 (s) < oo 
with probability one as long as a > 0. 

For r = 1 the condition on a can be shown to be sharp, and we expect that for all r 
the a > 0 is the right condition. In any event, consider (jl. 14[) and notice again that if the 
lowest point q r vanishes at some x' < oo, then q r ^ x < 0 for all x > x'. With any other the 
other particles qi, i — 1 ,..., r — 1 , possibly still positive, the corresponding contribution to 
the drift q r x may also be positive, and so push q r in the “wrong” direction. On the 


other hand, in that case one has that 
q r hits zero it is bounded above by the process 


qr+qi 

Qr Qi 


< 1 . 


This means that, at the very least, after 


dz x 


2 

VP 


z x db + 



(r — 


!) + p)z* 



dx : 


ignoring shifts in the time-like variable. The proof is finished by noting that the r = 1 
argument can be applied to z x with a replaced by a — r + 1. □ 


5 Hard-to-soft transition 

By following the law(s) on paths induced by (II. 10)1 and (II. 14)1 in the a —> oo limit we show 
that (resealed) hard edge spiked laws degenerate to the soft edge spiked laws (Theorem [6]). 

Proof of Theorem 0 Since we are taking an a —» oo limit, Corollary [5] shows that from the 
start we can just track blow-ups (to —oo) for the hard-edge processes. 
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Taking first r = 1, introduce the time-space scaling, 

i](x) = a~ 2 ^q(a~ 2 ^x) — a 1//3 , (5.1) 

where q is defined by (11 TOR after taking a into 2 a: dq = qdb + ((2a + |)g — q 2 — e~ x )dx. 
With the hard-edge spectral parameter scaled as in a 2 + a 4 // 3 A, the process 77 is then started 
at time x 0 = — a 2 / 3 log(l — a _ 2 // 3 A) from p 0 = p(x 0 ) = a _2 / 3 (c — a). For this recall that the 
start time // for q is related to the spectral parameter via A = e -/ h 

Now the essential observation is the same as in the corresponding proof from [T 8 ]J for the 
limit of A(/3, 2a, oo): q hits — oo in finite time if and only if r) does. Hence, things come down 
to showing that we have the functional convergence of 77 = r](x, a) governed by 

drj x = ^(1 + a~ 1/3 rj x )db x + [a 2/3 (l - e ~ a ~ 213 ^ - rjl + J(a _1/3 + a~ 2/3 r} x )]dx (5.2) 

:= e(x, 77 , a)db x + f(x, r/, a)dx, 

with initial state, 

(x 0 (a), 770 (a)) = (—a 2 / 3 log(l — a - 2 / 3 A), a“ 2 ^ 3 c(a) — a 1 ^ 3 ), 
and a new Brownian motion b x , to the one-spike soft edge process (II. 16jl : 

dp x = =db + (x — p 2 )dx , (5.3) 

with initial state (A ,w). But now, 

e(x : 77 , a) —>• 1 , / {x, 77 , a) —?• x — p 2 , 

uniformly on compact sets in ( x , 77 ), while Xo = —a 2 / 3 log(l — a' 2 / 3 A) —> A and 770 = a _2 / 3 c — 
a 1 / 3 —w by assumption. This is sufficient to imply the convergence of the solutions of the 
martingale problems (P ^ 0 m w ), see for example Theorem 11.3.3 of [2S|E| In this setting, 
this is the same as convergence in law on paths in the uniform-on-compacts topology. 

From here the proof that a 2 / 3 — a - 4 / 3 A 0 (/3, 2a, c(a)) A 0 (/3, w) will follow from the fact 
that T-ooijia ) T_oo(p), where u denotes the passage time to b. Again, this is what was 
done in [18j in the case that c(a) = w = + 00 . If b were finite the convergence in law of the 
corresponding passage times is a consequence of the continuous mapping theorem, and so it 
is enough to show that 

lim Pa,™('T-oo(p) - r b (p) > e) V sup P X 0 :r? 0 (^ 00 ( 77 ) - 77 ,( 77 ) | > e) = 0 (5.4) 

&-S .-00 a>1 

7 Here we are tacitly assuming that the limiting starting point w is finite, if a~ 2 ^ 3 c — a F 3 —7 00 an 
additional approximation step is needed, but this has been carried out in [15] . 
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for any e > 0. Since this approximation step was swept under the carpet in [18] we take the 
opportunity here to provide some details. 

Consider the question for the p-diffusion (the same argument will work for rj, but is just 
more cumbersome as one needs to keep track of the parameter a —y oo). Make a further 
approximation and truncate the stopping times, replacing 77, (p) with r k (p) = 77, (p) Ak, noting 
that 77,(77) =>■ 77, (p) if and only if r 6 fc (r/) =>■ r b (p). Also introduce the time 7 k (p) = inf {a; > 
T b(p) '■ p ( x ) > — j)k}- Now take / to be a smooth function with f{x) = 1/x for x < —10 
(say). Ito’s lemma along with the optional stopping theorem gives: assuming k > 10 V 4/(3 
and dropping the dependence of the various random times on the path, 

r T -oo 4 

E/(p(r^ A 7b k )) = E f(p(r b k )) + E j ^ (-p~ 3 - xp ~ 2 + 1 )dx 

> E/(p(r‘)) + A - r‘]. 

Hence, since either p(r b ) = b or r b = r 6 °°, 

p (r k -oo A 7 - 7 > e) < - Hk k _ i/p f (p(rt) = ») • 

And since lim^.ooP( 7 * < k } p(r b ) = b) = 0, one has that 

lim P (r7 - t* > e) = lim P (r^ A 7 * - r‘ > e) = 0, 
which proves (15.4[) (for the p process). 

Moving to the joint convergence of A 0 , A 1; ..., we have to show that, for any fixed fc and 
a —?• oo 

Pxo(a),? 7 o(a) ( x l— t r /( x ) a ) explodes exactly k times) 

goes over into the Pa,^- probability of the same event for p. Recalling the shorthand n x7 (dy) = 
P Z) bijoo E dy ) for the passage time distribution of either process, the obvious thing to do is 
rewrite the above as in 

/ ^(xo(a),r]o(a)) (dXi)/t( 3;ii -)_ 00 ) [dX2) ■ ■ ■ ^(x k ,+oo) ({ + OC>}) ^ 

J X 0 <x\<---<x k 

and use the k — 0 result. In particular, we know, now emphasizing the processes involved, 
that rio ^ Jdx) tends weakly to k p x w ^(dx) in the sense of measures. Again this includes 

lim^oo r? 0 (a))({"*”O°}) = K \\ m,)({+ 00 })) which is all that was needed for the first step. 

The same argument gives convergence of the passage time distributions starting from fixed 
initial conditions, or that ^^ +00 \(dx) —> k p xq +00 ^(dx) weakly. Once more, technical issues 
associated with starting from +oo were dealt with in [18]: since +oo is an entrance point for 
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all 77 (a) and p, one can well approximate the distributions(s) by the same process started at 
some (large) finite point. 

Consider k — 1, which enough to explain what needs to be done. The issue is the conver¬ 
gence of the integral f p n (dx)f n (x ) to its intended limit in which: ( 1 ) the (sub-probability) 
measures p n are absolutely continuous and converge weakly to a (sub-probability) measure 
p and, ( 2 ) /„—>•/ pointwise where all f n and / are uniformly bounded (by one) and con¬ 
tinuous. (Here f n (x) = «J'i“| oo) ({+oo}) abd p n (dx) = a ), vo ( a ))(dx). Outside of {+ 00 } 

the measures p n and p posses no point masses.) By the tightness of p n you can cut down 
the domain of integration to a bounded interval I. After this one can invoke say Egorov’s 
theorem along with the absolute continuity of the sequence of measures to deal with the 
error term J 7 (/ n - f)p n (dx). 

When r > 1 the only novelty in establishing the process level convergence is the interac¬ 
tion terms. Following (15.11) we scale q as in 


Vi(x) = a 2/i qi(a 2/3 x) 


d/3 




and (15.21) is replaced by: for a given index i, 

dpi, x = ^(! + a ~ 1/3r li, x )dbi tX + [a 2/3 (l - e ~ ra ~ 2 ' Zx ) - r ? 2 + |(a _1/3 + a~ 

2 a 1 / 3 + T]i jX + rjj, x 


(a 1/3 + a 2/ \, x ) ^2 


Vi,x Vj,x 


2/3 Vi,x)]dx 

■dx. 


Again the diffusion coefficient and first drift coefficient converge uniformly on compact sets 
to the right objects — the only change is the shift from e~ a x to e~ ra ~ ' in the latter. And 
again the initial conditions settle down the claimed limits, more or less by design. While 
the final term does tend pointwise to Tr l v . ? which is the interaction term in (I1.17j) . the 
convergence is certainly not uniform over {rp > 1 J 2 > ''' > Vr}■ 

On the other hand, one does have convergence on paths up to the stopping time 


r e := {inf x : rp x - p j)X < e for any i < j}. 

And since r e —> 00 in probability as e — > 0 for both the rj and p processes, general consid¬ 
erations (see for instance [25j Lemma 11.1.1]) yield the convergence =>■ P p . Previously 
we had cited [U Proposition 4.3.5] for the non-crossing property of (a process related to) 77 , 
which proves the result for all /3 > 1. I 11 fact, for fd < 1 it is known that there is a crossing 
in finite time with positive probability, see | 6 [ Theorem 3.1] which again deals with a more 
standard Dyson Brownian motion. We may still have the path convergence between the hard 
and soft edge diffusions for /3 < 1, but this would require an additional argument. This is 
the point of Remark [7] 

□ 
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6 Fredholm approach for [3 = 2 


For completeness we include an explicit calculation of the limiting r-spiked distribution 
function for /3 = 2 (and integer a > 0). The reported limiting kernel has already appeared in 
[8]. The main point here is to detail the trace norm convergence for the associated integral 
operators, and so deduce the corresponding Fredholm determinant form of the distribution. 
We follow the original strategy of |2]. A second motivation is to carry out the hard-to- 
soft limit in this context, giving an analytic proof of Corollary [9] at least for f3 = 2 and 
a = 0,1,..., etcetera. 

Now let X be a complex n x (n + a) Gaussian matrix with diagonal covariance £ = 
£ r © I n - r and £ r = diag(cri, cr 2 ,..., oy), and let A min = A m j n (£ r ) denote the minimal 
eigenvalue of XXf Our starting point is a version of the following basic formula. 


Proposition 19. (After JU Proposition 2.1]) At finite n there is the identity 

P ^nA. m i n (on, ■ ■ • j 0"r) ^ detx,2[ 0) t](/ Kn) 

where K n = K n a x r is the kernel operator defined by 


K n (x,y) 



[ dw e~n z+ n w (zy+°fl-±;\ n - r f ] a^-w 

J Co 2m w-z \w) \l-jJ - z ' 


( 6 . 1 ) 


Here Co and C\,\/„ are any disjoint, simple, closed contours (oriented counter-clockwise) 
which enclose zero and {1, crf \ ■ ■ •, vf 1 }, respectively. 


It is easy to show that: 


Claim 20. Setting <Jk — for k = 1,... ,r in (16. 1 j) and taking n —>■ oo, K n converges in 
the trace norm of operators on L 2 [0,t\ t—?• L 2 [0,f] to K defined by 


K (x,y) 



k (x,y-z,w) 


where 


k (x,y,z,w) 


e -x{z-±)+y(w-±) 


W — Z 


( z V +a TT 1 — CkW 

v wJ 1 -c k z' 


The notation/conventions are repeated from above in that Co and C\j c are disjoint, simple, 
closed contours, with the latter enclosing cf 1 ,..., cf 1 . The contours C and C can be taken 
any nested disjoint contours surrounding the origin. 


The probabilistic content of Claim [20] is that 

lim P(nA min (ci/n,... ,c k /n ) > t) = det L 2 [ot] (I - K). 

n—yoo V J 


( 6 . 2 ) 


38 









The related result in [8] establishes the pointwise convergence of K n (x,y) to K (x,y) — atten¬ 
tion to error estimates being beside the point for the considerations there. More notably, the 
authors of [8] derive different formulations for the K kernel, showing for instance that it is a 
rank r perturbation of a conjugated Bessel kernel. This stands in analogy to the results of 
[2] which show that the r-spiked soft edge kernel is a rank r perturbation of the Airy kernel. 
Here we highlight the following consequence of the operator convergence to K. Compare part 
(b) of [21 Theorem 1.1], 

Corollary 21. Let K c denote kernel operator K in the case that c± — ■ ■ ■ — c r — c. Then, 


liindet L 2 [0>ct5] (/ - K c ) = det L 2 [0jt] (J - L), 

where L is the projection onto the span of the first r Laguerre functions of parameter a. In 
other words, the distributional limit of lA min (cI ) as n —> oo then c —> 0 (in that order) is 
that of the minimal eigenvalue of a (r, r + a) complex Wishart matrix. 


Proof of the claim. Unlike in [2] this requires no steepest descent argument, just a change 
of variables. First, replace outer integral over Cgi/o- by that over the union of two disjoint 
loops Ci UC 1/a , each still avoiding the contour C 0 . This splits K n into two terms. Consider 
that given by the f Ci ^ f c integral. Fubini plus two residue calculations show that 
this integral can be replaced by that say over J,, 1+s ^ J|tq=i+< 5 ' Tli f° r ^ < or an y 
choice of nested contours enclosing both 0 and 1, but not any of the erf) 1 . With our choice 
1 + S < miujcr^ 1 } (as each af 1 will scale like n , we can assume they are all greater that 
one). 

After this, put cr*, = — in 116. ID . substitute nz and nw for z and w. The contours of 
course move as well, but can be deformed again after the fact. Setting, 


k n (x,y; z,w ) 


e ~xz+yw 


W — Z 


r+a 


1 

nw 


nz 


n 

k =1 


C k l ~ W 

c k 1 - * ’ 


we have that, 

K„(x,y) = 


'Ci/, 


dz 
2iri 


dw 


'C 0 


2ni 


:k n (x,y\z,w) + 


dz 
2 tt i 


dw 


-I 2ni 


- k n (x,y;z,w) 


Here C\/ c and Co can be chosen as in the claim, and taken to be independent of n. Similarly, 
one can choose C = {\z\ = e} and C = {\z\ = e'}, for e' < e < min(l, mir^c^ 1 ))• 


Now the kernel k n only depends on n through 


l—u 

nw 

i—C 

nz 


, which converges uniformly to 


e 1 / 2 along the contours. And with \w — z\ bounded below for (z,w) restricted to either 
set of contours, one sees that K n (x,y) —> K (x,y) uniformly for (x,y) G [0,t] 2 . But then, 
Jo K n {x,x)dx ->• f*K(x,x)dx, and f* f* f(x)g(y)K n (x, y)dxdy ->• f* J* f(x)g(y)K(x, y)dxdy, 
for all f,g G L 2 [0,t\. This implies the trace norm convergence by [2U Theorem 2.20]. □ 
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Proof of Corollary\21\ After scaling we can work with the kernel operator 


cK c (cx, cy ) 


/ / + / / 

>C 1/c JCq J\z\=e J \w\=e/2 

, hi I 

/Ci J Co J\z\=e J\w\=e/ 2 . 


ck(cx , cy ; 2, w) 


dz dw 


2ni 2iri 

e —x(z-f)+y(w—-£) / £ \r+a ^ 1 — yj 
W — z \wJ 


1-Z 


(6.3) 


dz dw 
2ni 2ni ’ 


restricted to L 2 [ 0, t]. This is just another change simple change of variables — (z, w) i— y (-, —) 
— followed by a pair of contour deformations. Here we have chosen a particular C and C' in 
the second integral (and deformed back to this choice after the variable change). 

Next, by the same convergence criteria used for Theorem [20j and basically the same 
argument, we see that (16.31) tends in trace norm to its evaluation at c = 0. Letting e —> 0 
in that case shows that the second integral in fact drops out. That is, det^y[ 0)Ct ](/ — K c ) — > 
det L 2 [0it ](J - L), where 


L (x,y) 



r dw e ~ xz+yw 
l Co 2iri w — z 




w 

z 


r 


(6.4) 


At this point one may use integral representations for the Laguerre polynomials (for example, 
L k( x ) = fc 0 hilh(l + h +a ) to conclude that indeed L(x, y) = e~ x/ 2 e - yl2 Ylho L k( x ) L k(.y)- 
On the other hand, the original kernel K n , or (16. ID . corresponds to the (null) Wishart case 
if all the ay are set to one. That is, it must degenerate to the Laguerre projection kernel (or 
some conjugation thereof). But, undoing the scaling (replacing K n (x,y) with nK n (nx,ny)), 
then setting n = r and ay = 1 for k — 1,..., r, we recognize this is the same formula as the 
right hand side of (16. 4p . □ 
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